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Abstract — We present a decoding algorithm for quantum 
convolutional codes that finds the class of degenerate errors with 
the largest probability conditioned on a given error syndrome. 
The algorithm runs in time linear with the number of qubits. 
Previous decoding algorithms for quantum convolutional codes 
optimized the probability over individual errors instead of classes 
of degenerate errors. Using Monte Carlo simulations, we show 
that this modification to the decoding algorithm results in a 
significantly lower block error rate. 

Index Terms — Convolutional-codes, Quantum error correction, 
Viterbi algorithm 

I. Introduction 

The stabiHzer formaHsm yj for quantum error correc- 
tion reveals many similarities between classical and quantum 
codes. In particular, it shows that quantum codes can be 
obtained from classical self-dual codes over F4 [2], and inherit 
many of their properties. Despite this connection, there exists 
fundamental distinctions between classical and quantum codes, 
among which degeneracy stands out |i3J, [SJ. Whereas two 
distinct bit-flip patterns e and e' applied to a given bit string 
s always produces two distinct strings s + e and s + e', it is 
possible in quantum mechanics to find two distinct errors E 
and E' that have exactly the same effect on every code state, 
i.e. E\ip) = E'\ip) for every state lip) in the code space 
We say that errors E and E' are degenerate for the code 
In fact, every quantum code has degenerate errors, but we say 
that a code is degenerate when it has degenerate errors that 
are typical]^ 

Degeneracy is a purely quantum mechanical effect with no 
classical counterpart, and leads to important distinctions be- 
tween classical and quantum information theory. In particular. 
Shannon's bound to the capacity of a memoryless channel -yV 

C(^) < max/(X : .yK{X)) (1) 

is established by counting the number channel outputs that 
are produced by applying typical errors on all possible input 
codewords, and demanding that it does not exceed the total 
number of strings. If we apply the same counting argument to a 
degenerate quantum code, because some errors have the same 
effect on all codewords, we overestimate the number of distinct 
channel outputs and hence underestimate the channel capacity. 
Indeed, the quantum analog of the mutual information I{X : 
,yV{X)) for an input distribution X and channel ,yV is the 
coherent information ® ,yV{4)AB)) for a purification (t)AB 
of an input distribution 0^ ™d a quantum channel . Based 

'Thus, the notion of degeneracy makes implicit reference to an error model. 



on this analogy, one would expect that the quantity 

Qi(,_yr) =maxr(/®^(0^B)) (2) 

upper bounds the quantum channel capacity. But due to the 
existence of degenerate errors, the true quantum capacity is 
instead given by regularized quantity IS, 111, 111 

Q(^) = lim -maxr(/®^^"(0^.B.)). (3) 

n->oo n 4>Bn 

The quantity Qi can be achieved by non-degenerate codes — 
for Pauli noise defined below it is achieved by random 
stabilizer codes similar to those used in Shannon's original 
construction in the classical setting. However, Q requires the 
use of degenerate codes. Because Q involves an optimization 
over an input distribution correlated across many channel uses, 
it cannot be computed in general. However, degenerate codes 
of finite rate have been tailored for channels with Qi — |3|, 
El, 0, a striking demonstration that Q can be greater than 

Qi- 

To benefit from the degeneracy of a quantum code, it is 
necessary to take it into consideration in the decoding process. 
Maximum a posteriori (MAP) decoding usually consists in 
identifying the error with largest probability conditioned on 
a given error syndrome. But since degenerate errors have 
the same effect on all code states, they can all be corrected 
the same way. Hence, for a degenerate quantum code, MAP 
decoding should instead consist of identifying the class of 
degenerate errors with the largest probability conditioned on 
a given error syndrome, the probability of a class of errors 
being the sum of the probabilities of its elements. This 
can be substantially more complicated than standard MAP 
decoding. Topological codes ||9l, ifTOl are a good example of 
codes for which a standard MAP decoder exists f9l but no 
degenerate MAP decoder is known, and where we know that 
degeneracy can provide an advantage ifTTl . Thus, the design of 
degenerate MAP decoding algorithms is an important problem 
for quantum information theory. 

In this article, we study this problem for quantum con- 
volutional codes. MAP decoding of classical convolutional 
codes can be formally classified as a MAX-PROD problem, 
and is solved using Viterbi's algorithm lfT2l . The MAX is to 
optimize the conditional probability over all errors, while the 
PROD reflects the fact that the probability of a given error 
on a memoryless channel is the product of the probability of 
each of its components. Viterbi's algorithm makes use of the 
distributive law maxr^ y x ■ y = max^ x ■ maxy y to solve the 
MAP decoding problem ifTSl in a time that scales linearly with 
the length of the code. Using the stabilizer formalism, all this 



machinery can be imported to the quantum realm and yields 
a MAP decoder for quantum convolutional codes [14], [15] . 

A degenerate MAP decoder has a different formal structure 
however; the need to add-up the probabilities of all degenerate 
errors entails a MAX-SUM-PROD problem. By making use 
of the distributive law of the product over the sum and of the 
max over the product, together with the particular factorized 
structure of the problem, we conceive a generalization of 
Viterbi's algorithm that exactly achieves degenerate MAP 
decoding of quantum convolutional code in linear time. Using 
Monte Carlo simulations, we find that the degenerate decoder 
suppresses more errors than the standard non-degenerate de- 
coder; the improvement becoming more important at low error 
rates. Thus, convolutional codes provide an example where 
degenerate MAP decoding can be performed efficiently and 
yield a significant performance gain. 

The rest of this article is organized as follows. The next 
section introduces basic concepts. In Sec. 



Ill we define the 



decoding problem for quantum codes, emphasizing on the 
role of degenerate errors. Section |IV] summarizes Viterbi's 
algorithm for quantum convolutional codes, and explains how 
it can be modified to take degeneracy into consideration. 
Lastly, we present numerical result in Sec.|V] Our presentation 
of quantum convolutional codes and their decoding algorithms 
follows ifTSll . we refer the reader to this article for more 
technical details on these basic concepts. 

II. Definitions 

A. Stabilizer codes 

A quantum state of n qubits is specified by a vector \ip) in a 
2"-dimensional vector space J^. A quantum code encoding 
k qubits into n qubits is a 2 '^-dimensional subspace of Jif. 
The stabilizer formalism offers a compact description of such 
a subspace, making use of the Pauli group. The Pauli matrices 
are defined as follows 



X 



(4) 



Together with the 2x2 identity matrix / and the imaginary 
unit i, they form a group under multiplication, the Pauli group 
^1. The ?i-qubit Pauli group is the rt-fold tensor product of 
the single-qubit Pauli group ^„ = An important fact 

about is that all of its elements either commute or anti- 
commute, i.e. for all P,Q e PQ = ±QP. Using the fact 
that ZX = lY, we find that {i, Xa, Za} is a generating set 
of where Za (Xa) stands for the Pauli operator Z (X) 
acting on qubit a, and is the identity elsewhere, i.e. 



Za^I(E)Ii 



.I®Z ® I ® I ( 



(5) 



A stabilizer group ,y is an Abelian subgroup of ^„ that 
does not contain the element — /. It can be specified by s < n 
independent stabilizer generators {Sa}a=i,....s- The quantum 
code associated to the stabilizer group is defined by a 
set of eigenvalue equations 



Note that the condition Sa\ip) = for aU generators Sa of 
is enough to ensure Eq. (|7]i. The dimension of the code 
is 2*^ with k — n — s, so we say that the code encodes k 
logical qubits into n physical qubits, so has rate - . Clearly, the 
stabilizer generators Sa play a role analogous to the linearly 
independent rows of the parity check matrix of a classical 
linear code. 

The centralizer C(^) of ^ — i.e. elements of ^„ that 
commute with all elements of y — are called Pauli codewords. 
Multiplying a code state \t/j) E by a Pauli codeword 
L G C(,y) produces another code state. This can be verified 
directly since S{L\tl!)) — L{S\ip)) — L\tlj) which shows that 
L\ip) is an eigenstate of eigenvalue +1 of all stabilizers S, 
where we have used the commutativity of L and S. More 
generally, all code states can be generated starting from an 
arbitrary fiducial code state lipo) and multiplying it by a logical 
operator L £ Alg(C(^)) in the algebra generated by the Pauli 
codewords. 

The encoding circuit is another method to specify a stabi- 
lizer code. The main advantages of this circuit-based definition 
is that all commutation constraints discussed above are auto- 
matically satisfied, and it is more suited for the definition of 
quantum convolutional codes. The Clifford group on n qubits 
is the normalizer of the Pauli group in C/(2"). In other words, 
Clifford transformations consist of unitary transformation U 
on n qubits for which U PW is a Pauli operator, for all Pauli 
operators P. Any Clifford transformation on n qubits can be 
decomposed into a quantum circuit composed of controlled- 
not gates, Hadamard gates H = ^(i -i)> ^"d phase gates 
R = (o i )■ Given an n-qubit Clifford transformation U, we 
can construct a stabilizer code with stabiUzer generators 



Sa^UZaU\ 



1. 



(7) 



and Pauli codewords 

Xa^UX,+aU\ Za = UZ,+aU\ a=\,...,k. (8) 

For later use, it is also convenient to define operators that we 
call pure errors as 



Ta = UXaU\ a = l,...,; 



(9) 



Since conjugation by a unitary matrix preserves the com- 
mutation relations, the following commutation relations are a 
straightforward consequence of definitions Eqs. |7](9| 



\Sa, •S'b] 
[Sa, Zb] 





[Sa, Xb] 







SaTb = {-l)'-'nSa 



(10) 

(11) 

(12) 



(6) 



where Eq. ( [TO] l expresses the fact that is Abelian, Eq. ( [TT| ) 
shows that Pauli codewords are in the center of and 
Eq. ( [T2] l will be helpful in formulating the decoding prob- 
lem. Also, because the operators {Za,Xa} generate and 
because completeness is not affected by a unitary trans- 
formation, it follows that {i, Za,Xa, Sb,Tb}a=l^...^k,b=l,...,s 

generate Lastly, U is called the encoding circuit because 
any state obtained from the following circuit 



u 



where \tp) can be an arbitrary fc-qubit state, is a code state, 
i.e. = [/(lO)*^* (g) IV^)) e In this circuit, we call the s 
top input qubits the ancillary qubits, while the bottom k input 
qubits are called the data qubits as they carry the information 
prior to encoding into 

B. Pauli noise 

The natural noise models to study using stabilizer codes 
are Pauli noise models. In these models, the noise alters an 
encoded state \tp) by multiplying it by an element of the Pauli 
group E e i.e. \'il>) — > The errors E are chosen 

at random according to some probability P{E) that specifies 
the noise model. A common assumption is that of an i.i.d. 
noise model where P{E) is the n-fold tensor product of a 
distribution (1 — — Py — Pz,Px,Py,Pz) over the single qubit 
Pauli grouprl The symmetric choice Px = Py = Pz = P 
called the depolarization channel of rate p. 

C. Quantum convolutional codes 

Quantum convolutional codes were introduced in lfT6l . ifTTl . 
but here we follow essentially the definitions of ifTSll . and refer 
the reader to this article for more technical definitions. An 
(n, k, ?7)-quantum convolutional code is a stabilizer code with 
stabilizer generators of the form 



{Sa,t 



Sa}a=l 



,s\t=l,...,T 



(13) 



where Sa S 3^n+ri and 77 is called the constraint length of 
the code. In other words, there are s distinct (n + ry)-qubit 
Pauli operators Sa that are translated by integer multiples 
of n to generate the entire set of generators. Note that the 
total number of qubits used in the code is left unspecified 
in this definition; the maximum value r of f determines the 
length of the convolutional code. Thus, it is implicitly assumed 
in Eq. ( [T3| ) that the operators are padded to the right with 
identity matrices so that they all act on N = nr + rj qubits. 
Of course, the Sa should all commute with one another to 
ensure [Sa,t, Sa' ,t\ — 0, and also with their translations by 
integer multiples of n to ensure [Sa.ti Sa' ,t'] — 0. For codes 
with large constraint length 77, this last commutation condition 
can lead to a large number of constraints on the Sa that are 
difficult to fulfill. But as for stabilizer codes discussed above, 
there exists an equivalent circuit-based definition of quantum 
convolutional codes that circumvents this difficulty. 

In this circuit-based definition, a (n, fc,m)-quantum convo- 
lutional code is a stabilizer code whose encoding circuit takes 
the particular form 

^Note that the absolute phase of the noise — e.g. whether error X, or iX, 
or —X was applied — has no observable effect, so our description of errors is 
over equivalent classes of Pauli operators defined modulo a phase. 
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which illustrates the case n = 5, fc = 2 and m — 1. The 
encoding circuit is composed of a seed transformation V, 
element of the Clifford group, that is applied periodically to 
non-disjoint sets of qubits. We refer to a given period of this 
circuit as a time frame of the code. The qubits that overlap 
two consecutive applications of the seed transformation are 
referred to as memory qubits, and there are in general ni of 
them in every frame. In each frame (except the very first one), 
there are a total of n input qubits, k of which carry quantum 
information (marked "Data" in the circuit) and s = n — k 
are ancillary qubits in the sate |0). The rate of the code is 
therefore -. A quantum convolutional code can contain an 
arbitrary number r of frames. For a given value of r, we 
obtain a code block containing a total of N — t x n physical 
qubits and encoding t x k logical qubits]^ 

A generating set for the stabilizer is given by Sa = UZaU^ 
where U is the -qubit Clifford transformation resulting from 
the entire sequence of seed transformation V, and the qubit 
index a varies over all ancillary qubit index, i.e. locations on 
the encoding circuit marked with qubits in a |0) input state. 

The notation used in the definition based on Eq. ([T3| and the 
definition based on the encoding circuit differ slightly because 
there is no direct relation between the constraint length 77 and 
the number of memory qubits m. However, one can easily 
show that a convolutional code defined by a circuit with m 
memory qubits admits a set of generators of constraint length 
T] < 4™. 

III. Degenerate decoding 

As in the classical setting, the first step in the decoding 
process is the error syndrome extraction. In quantum me- 
chanics, this is done by measuring the stabilizer generators. 
Remember that in quantum mechanics, the measurement of 
an observable — i.e. an Hermitian matrix — gives an outcome 
equal to the eigenvalue of the operator corresponding to the 
state of the system. By definition, all code states have +1 
eigenvalues for all stabilizer generators, i.e. Sa\ip) = If 
an error E corrupts the state to E\ip), the eigenvalue of Sa 
will remain the same when E and Sa commute, and will be 
changed to —1 when E and Sa anti commute. To see this, 
note that 

" ESaW_= E\Jj) _ if SaE = ESa 

-ESam = ~E\^) if SaE = -ESa. 



Sa{Em 



^Depending on the way the code is terminated, there can be a constant 
number of extra physical qubits in the block, which decreases the code's rate 
by a quantity that vanishes as ^, see 1 1 81 , [15j. 



The ±1 measurement outcome of Sa gives the ath syndrome 
bit Sa, and we denote the collection of s syndrome bits s = 

(Sl, S2, . . . Ss). 

The goal of the non-degenerate (ND) decoder can be stated 
quite simply: find the most probable error conditioned on the 
measured syndrome. If we denote s{E) — {si{E), . . . , Ss{E)) 
the syndrome that error E would produce, non-degenerate 
MAP decoding consists of the optimization problem 



^MAp(s) 



argmax P{E) 

E:s{E)=s 



(14) 



where P{E) is given by the noise model, and argmax denotes 
the argument that achieves the maximum. 

To explain the degenerate decoding problem, it is convenient 
to express PauU error s in a basis tailored to the stabilizer code. 
As explained in Sec. II-A the set {i, Za, Xa, Sb, TJ,} forms a 



basis for ^^„. Thus, an error E has a unique decomposition 
into a product of logical operators, elements of the stabilizer 
group, pure errors, and an irrelevant phase factor that we will 
henceforth ignore. In other words, we can uniquely decompose 
any error as £: = LST where L e C{y), S & ,9', and 
T e {To)- We can therefore interpret the noise model P{E) 
as a probability distribution over L, S, and T, simply setting 



P{L,S, T) = P{E = LST). 



(15) 



Observe that the syndrome s is in one-to-one correspon- 
dence with the pure error component T, namely, an error with 
syndrome s has the pure error component 



0/2 



(16) 



P{E\s)^P{L,S\T{s))^ 



(17) 



where = I and = Ta- This can be seen from the 
commutation relation Eq. ( [T2] l, which shows that Sa anti- 
commutes with E if and only if E contains Ta when decom- 
posed in the basis {Za, 5*6, ?],}. In addition, Eqs. lo][TT 
show that the S and L components of an error E have no 
effect on its syndrome. Thus, knowledge of the error syndrome 
is equivalent to knowledge of T, so the error probability 
conditioned on the error syndrome is obtained by Bayes' rule 

P {L, S,T{s)) 
P(T(s)) 

where the marginal probability is defined as usual P{T{s)) ~ 
El,sP(L,S,T{s)). 

Degenerate errors are those that differ only by their S 
component. Indeed, the errors E and E' = ES for S £ -J^ 
have exactly the same effect on all code states by definition. 
Thus, only the logical component L needs to be identified 
to correct the error; the T component is known given the 
syndrome and the S component has no effect on the encoded 
information. The degenerate MAP decoding therefore consists 
of the optimization problem 



-^MAp(s) 



argmaxP(L|r(s)), 

L 



(18) 



where P{L\T{s)) = J^s ^(■^' '^'[^(s)) is the marginal condi- 
tional distribution obtained from Eq. ([TtJ. 
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Fig. 1. Decomposition of an error E into a tensor product of errors Et 
in eacli n-qubit frame. The error produces a syndrome s and an un-encoded 
logical component L, that are also broken-up into components on each frame 
St and Lt- 



An equivalent way to explain the degenerate decoding 
problem is to imagine un-encoding the corrupted encoded 
state E\i})). This produces the state E\ip) = WEU{\'4>) ® 
10)^'*) = QIO)'^'* where L ® I'^^ = iPZU is the 

un-encoded version of the logical component L of E and 
I®^ ®Q — WTSU is the un-encoded version of the stabilizer 
S and pure error T components of E. Note that the un-encoded 
version of S contains only Z operators on the ancillary qubits 
by definition, c.f. Eq. (|7]i. Since Z\Q) = |0), we conclude that 
the S component has no effect on the un-encoded corrupted 
state, a direct manifestation of degeneracy. The T component 
on the other hand map to X operators under W , c.f. Eq. 
We conclude that the final state is W E\ip) = L\ip) ® \s) 
where |s) is a shorthand for a s-qubit state with qubit a in a 
state \—. 



-). The error syndrome can be directly obtained by 
measuring the ancillary qubits, and the degenerate decoding 
problem consists in identifying the most likely L given s. 

IV. Degenerate Viterbi algorithm 

With convolutional code, it is natural to use a notation 
that reflects the periodic structure of the encoding circuit. 
We use a two-index notation to identify qubits: qubit {t, a) 
corresponds to the ath qubit of the ith time frame, in other 
words the [{t — l)n + a]th qubit altogether since there are n 
qubit per time frame. Similarly, the error syndrome s naturally 
breaks into r distinct s-bit syndromes s — (si, . . . , s,-) with 
s* = • ■ • St.s)- Following the last paragraph of the 

previous section, syndrome bit ^ is obtained by measuring 
the ancillary qubit (i, a) after having un-encoded the corrupted 
state, see Figure [T] Similarly, it is natural to also break L and 
E into a product of operators on each frame Lt and Et, see 
Figure [T] Since we assume that the quantum channel is mem- 
oryless, the error probability factors as P{E) = Yit^i-^t)- 

The standard, non-degenerate, decoding of quantum convo- 
lutional code uses a trellis. Trellis-based decoding of quantum 
convolutional codes was introduced in |14|, but here we 
follow the presentation of ifTSll . and refer the reader to this 
article for more details of the construction. The trellis for an 
{n, k, TO)-quantum convolutional code of length r is a directed 
multigraph whose vertices can be grouped into t sets At - Each 
element of Af is labeled by a distinct element Mt € ^m. 



so I At I — 4™. Given an error syndrome s, there is an edge 
between Mt-\ E At_i and Mt G At if and only if there exists 
an Lt e ^fe, Zt e (/, Z)^ and Et € ^n-m such that 



V{Mt-i <E} ZtX{st) <E} Lt)V^ =Et® Mt, 



(19) 



where X{st) = JlLi There can in general be 

different choices of Et that fulfill this condition, and we label 
the multiple edges by the associate Et- This condition is 
illustrated by the circuit 



Mt-i<- 
ZtXist){— 
Lti- 



vm 



Et 

yMt 



which can be understood, reading from right to left, as one 
segment of the un-encoding circuit which proceeds recursively 
starting at t = r and making its way to i = 1. The interpre- 
tation is a situation where errors in the previous un-encoding 
steps have resulted in a memory state Mt which, combined 
with the error segment Et, produces the right syndrome bits 
St for the time frame t, modifies the logical state by the 
application of Lt in this frame, and sets the memory in a 
new state Mt-i- 

With this interpretation we see that paths in the trellis 
correspond to all the distinct errors E with syndrome s. To 
find the most likely error (or path) among them, we associate 
a probability (or weight) to each edge. The probability asso- 
ciated to the edge linking Mt^i to Mt labeled Et is P{Et), 
the probability of the corresponding error, specified by the 
error model. The non-degenerate MAP decoding then consists 
in finding the path in the trellis with the largest probability, 
where the probability of a path is equal to the product of 
the probability of each of its segments. It is convenient to 
speak instead of the length of a path, which is equal to the 
sum of the weight of its segments, the weight of segment 
w[Mt-i,Mt;Et) labeled Et being -\nP{Et), and we set 
w{Mt-i,Mt; Et) = oo if there is no edge labeled Et between 
Mt-i and Mt- Then, the decoding problem becomes that of 
finding the shortest path in the trellis. 

Viterbi's algorithm solves this problem recursively, starting 
at i = T and decreasing the value of i by 1 at each iteration, by 
associating a cumulative distance to the vertices of the graph. 
The cumulative distance of vertex Mt, d{Mt), is equal to the 
length of the shortest path starting ?At — t and leading to Mt- 
It obeys the recursive equation 



d{Mt-i) = min [d{Mt) + w{Mt-i,Mt;Et)], 

Mt ,Et 



(20) 



so the algorithm keeps only the edges that realize the maxi- 
mum for each A/t-i, the other edges are erased. The initializa- 
tion of the algorithm is given by d{Mr) — — In P(Mt-) where 
P{Mr) is the error model on the last m bits of the circuit. To 
explain the termination of the algorithm, we must specified that 
the first m qubits of the quantum circuit (the memory qubits 
of frame 0) are used as additional ancillary qubits. Thus, they 
are initialized in state lO)**™ prior to encoding and measured 



in the Z basis after the un-encoding, see ifTSl for more details. 
The output of this measurement reveals the X component of 
A/q, and only the values of Mq with the correct component 
are kept. The shortest path is the one connecting to the Mq 
with the shortest cummulative distance d{Mo). 

The reason why the previous algorithm does not take 
degeneracy into account is that each path is associate to a 
distinct physical error For degenerate decoding, errors need 
to be associated to equivalent classes of states instead, where 
errors that differ by an element of ^ are joined in the same 
class. One way to keep track of such classes is to label them 
by their logical un-encoded component L = Li ® . . . ® Lr- 
The degenerate Viterbi algorithm will thus use the same 
trellis structure as above, but instead of labeling an edge that 
fulfills Eq. ([19]) by the corresponding Et, it will be labeled 
by the corresponding un-encoded logical operator segments 
Lt G ,^k- As before, the probability associated to an edge 
will be equal to P{Et) for the Et fulfilling the condition 
Eq. ( [T9| ). Thus, at this point, the construction of the trellis 
follows exactly the original prescription of ||T4| . but its edges 
are labeled differently. 

For a given syndrome segment Sj, un-encoded logical 
segment Lt, and memory states Mt^i and Mt, there can 
be multiple distinct Et that fulfills the condition Eq. ([T9|. 
Call this set of solutions il{Mt-i, Mt, Lt,St)- Thus, the 
trellis contains multiple edges between some memory states 
Mt-i and Mt with the same label Lt- These multiple edges 
il{Mt-i, Mt, Lt, St) correspond to degenerate paths, i.e. paths 
that differ only by an element of the stabilizer group. There- 
fore, they should be viewed as equivalent paths so we merge 
them into a single super edge as illustrated in Fig. |2] The 
probability of a super edge is equal to the sum of the 
probabilities of the merged edges, or in terms of weight: 



wiMt-i,Mt;Lt) = - In 



E 



P{Et) 



(21) 

In terms of these weights, the degenerate decoding problem 
becomes the problem of finding the shortest path in the trellis, 
for which Viterbi's algorithm can be used. 

In summary, our modification to Viterbi's decoding algo- 
rithm consists in a pre-processing phase that modifies the 
trellis my merging edges that correspond to degenerate errors 
and computing the associated weights. Given this modified 
trellis, the degenerate decoding problem becomes identical to 
the non-degenerate decoding problem, and can be solved with 
Viterbi's original algorithm. 

V. Results 

We have benchmarked our degenerate Viterbi decoder using 
Monte Carlo simulations on quantum convolution codes of 
finite block length r — 600 with various code parameters 
(n,k,m), results are presented in Fig. [s] The codes were 
generated by choosing the seed transformation V at random 
in the Clifford group. For each set of parameters (n, k, m), we 
have simulated a few dozens of such randomly generated codes 





Fig. 2. Merging procedure. Multiple edges with the same label linking Mt-\ 
to Mt are merged into a unique super edge with the corresponding label. The 
probability of a super edge is equal to the sum of the probabilities of the 
merged edges. 



and kept the most interesting results, although our conclusions 
extend qualitatively to all codes we have simulated. The codes 
are terminated by padding, as explained in |15|. 

The Monte Carlo simulations were realized as follows. 
An N qubit error E is generated randomly according the 



depolarizing error probability of rate p (c.f Sec. II-B i and the 
corresponding error syndrome s is calculated. The syndrome 
is fed as input to two distinct algorithms: a degenerate and a 
non-degenerate decoding algorithm. The degenerate decoder 
outputs the most likely logical error component i^^p(s) 
given by Eq. ([TSj. It is declared successful if the output 
^MAp(^) belongs to the equivalence class of the randomly 
generated error E, and failed otherwise. The procedure is 
repeated A^sampio times to accumulate statistics, and the block 
error rate equals the frequency of its failures. The number of 
samples iVsampic is adjusted such that at least 30 significant 
events are observed, resulting in a relative error at most l/\/30 
on the reported data. Figure [3] reports the block error rate of 
the degenerate decoder as a function of the channel error rate 
for different convolutional codes. 

A similar procedure is performed in parallel with the non- 
degenerate Viterbi decoder during our Monte Carlo simula- 
tions. Given the input syndrome s, the non-degenerate decoder 
outputs the most likely error £'^^p(s) given by Eq. ( [T4] i. 
It is declared successful if the output _E^^p(s) belongs to 
the equivalence class of the randomly generated error E, 
and failed otherwise. Note that the non-degenerate decoder 
therefore benefit from the existence of degeneracy because 
its output E^^-p{s) is not required to exactly match the 
randomly generated error E to be accepted as a successful 
decoding, as long as they belong to the same equivalence class. 
The distinction is that degeneracy is not explicitly taken into 
account to estimate the error 

To evaluate the impact of degeneracy on the decoder's 
performance, we compare its block error rate with the one 
obtained with the non-degenerate Viterbi decoder In the inset 
of Fig. |3] we show the ratio between the non-degenerate 
decoder's block error rate to the one of the degenerate decoder 
The fact that all data points are above 1 indicates that the 
non-degenerate decoder always yields a lower block error rate 
than the non-degenerate decoder We also observe in the inset 
of Fig. |3] that the benefit becomes more prominent at low 
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Fig. 3. Block error rate as a function of the channel depolarizing rate p 
for different (n, k, ni)-quantum convolutional codes: circles (4, 1,1); squares 
(4, 1, 2); triangles (5, 1,3). Since all codes have fc = 1 and the length of the 
code is T = 600, all codes encode 600 logical qubits. The triangles represent 
a code of lower rate, so as expected it shows overall better performances. 
Inset: the ratio of block error rates obtained with the non-degenerate Viterbi 
decoder and the degenerate Viterbi decoder. The degenerate Viterbi decoder 
always yields a lower block error rate, and this advantage increases as the 
channel eiTor rate decreases. 



depolarizing rates. This effect appears to be independent of 
the code parameters: the block error rate ratios of all the 
codes we have simulated show a clear monotonic increase as 
p decreases. We have observed gains as large as 4.4 dB. 

VI. Conclusion 

Degeneracy is a key feature of quantum codes that is at the 
origin of important distinctions between classical and quantum 
information theory. To gain from the existence of degeneracy, 
it must be taken into account during the decoding process 
of a code. Here, we have developed a decoding algorithm for 
quantum convolutional codes that exactly solves the maximum 
a posteriori decoding problem over equivalent classes of de- 
generate errors. The main modification to the standard Viterbi 
algorithm is a pre-processing phase that modifies the trellis. 
Our Monte Carlo simulations show that degenerate decoding 
improves the error suppression of the code, and that this effect 
becomes more prominent at low error rates. 

Because convolutional codes are the main ingredient of 
quantum turbo codes, a natural next step would be to use our 
degenerate decoder in a concatenated scheme, where its benefit 
could be further amplified. The development of a fault-tolerant 
computing scheme based on these codes is also desirable. The 
importance of our decoder may be particularly important in 
this setting since fault-tolerant quantum computers operate at 
low error rates, where the effect of degenerate decoding are 
most prominent. 
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